Context-aware semantic classification of search queries for browsing community question-answering archives

نویسندگان

  • Alejandro Figueroa
  • Günter Neumann
چکیده

Community Question-Answering (cQA) platforms have become massive repositories of user-generated content. To a great extent, these archives have proven to be highly re-usable. For instance, web search engines profit from their best answers for enhancing user experience when resolving question-like queries. Hence, considerable research efforts have gone into trying to revitalize and retrieve past answers contained in these archives. However, similarly to traditional web search, there is a linguistic gap between cQA questions and question-like search queries that are utilized for fetching information from these cQA repositories (e.g., “rib pain after ovulation” and “iron oxide household”). In fact, this gap does not only consider linguistic features, but also structural and social attributes. On the one hand side, cQA questions are long-winded, they can bear a title and a body, and community members are compelled to categorize questions at posting time. On the other hand side, search queries come as an uncategorized short stream of words. Moreover, in juxtaposition to cQA question, users typically submit streaks of semantically related search queries, when attempting to fulfil their information needs. This work digs deep into effectively exploiting semantic cues, yielded by preceding queries within the same user session, for classifying question-like search queries into twenty-six semantic cQA question categories. In order to find significant discriminative properties, we carried out experiments on a large-scale dataset acquired automatically. Broadly speaking, our results indicate that more effective semantic features can be computed as long as we account for a larger number of previous queries. In particular, facilitating Explicit Semantic Analysis for modelling the query context shows to be extremely helpful for increasing the classification rate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatically Generating Questions from Queries for Community-based Question Answering

This paper proposes a method that automatically generates questions from queries for community-based question answering (cQA) services. Our query-to-question generation model is built upon templates induced from search engine query logs. In detail, we first extract pairs of queries and user-clicked questions from query logs, with which we induce question generation templates. Then, when a new q...

متن کامل

Building a Question Classifier for a TREC-Style Question Answering System

We define Question Classification (QC) here to be the task that, given a question, maps it to one of k classes, which provide a semantic constraint on the sought-after answer [Li02]. The topic of Question Classification arises in the area of automated question-answering systems, such as those created for the TREC question answering competition. Automated question-answering systems differ from o...

متن کامل

Modeling Community Question-Answering Archives

Community Question Answering (CQA) services contain large archives of previously asked questions and their answers. We present a statistical topic model for modeling Question-Answering archives. The model explicitly captures relationships between questions and their answers by modeling topical dependencies. We show that the model achieves improved performance in retrieving the correct answer fo...

متن کامل

AquaLog: An Ontology-Portable Question Answering System for the Semantic Web

As semantic markup becomes ubiquitous, it will become important to be able to ask queries and obtain answers, using natural language (NL) expressions, rather than the keyword-based retrieval mechanisms used by the current search engines. AquaLog is a portable question-answering system which takes queries expressed in natural language and an ontology as input and returns answers drawn from the a...

متن کامل

Learning to Rank Questions for Community Question Answering with Ranking SVM

This paper presents our method to retrieve relevant queries given a new question in the context of Discovery Challenge: Learning to Re-Ranking Questions for Community Question Answering competition. In order to do that, a set of learning to rank methods was investigated to select an appropriate method. The selected method was optimized on training data by using a search strategy. After optimizi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Knowl.-Based Syst.

دوره 96  شماره 

صفحات  -

تاریخ انتشار 2016